Quantitative approach to facilitated diffusion with intersegmental jumping 
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We study the impact of DNA coiling on the search rate of proteins moving along the DNA 
contour interspersed by three-dimensional (3D) bulk excursions. When the DNA is coiled proteins 
performing short 3D hops along a DNA segment can be captured by foreign DNA segments that 
have looped back close to the original segment. These intersegmental jumps are shown to enhance 
the search rate for a specific site on the DNA by lowering the tendency to resample previously 
visited sites on the DNA. The model developed here offers a quantitative description of recent single 
molecule experiments on facilitated diffusion of restriction enzymes EcoRV. 
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A single stochastic search mechanism is usually not ef- 
ficient at optimizing a search process. Thus Brownian 
motion, whose fractal dimension is dj = 2, in ID and 2D 
leads to significant oversampling in the sense that a given 
site is revisited many times [l|, . Levy motion with a 
jump length distribution \{x) ~ (0 < a < 2) 

has df = a and thus a comparatively reduced degree of 
oversampling. However, the first arrival of Levy motion 
is hampered by leapovers, i.e., frequent overshoots of the 
target In 3D both Brownian and Levy motions are 
inefficient at locating a small target. While typical chem- 
ical reactants in a liquid environment have no choice but 
to passively diffuse until mutual encounter [4] intermit- 
tent models combine different search mechanisms [1, |3] . 

An important example for intermittent search is facil- 
itated diffusion of DNA binding proteins searching for 
their specific target binding site on a DNA molecule. In 
the Berg- von Hippel model ,6] proteins intermittently dif- 
fuse in 3D bulk or along the ID DNA chain 0] , the latter 
mediated by non-specific binding (NSB) 0. 

Previous approaches to facilitated protein diffusion 
consider bulk exchange for a straight cylindrical chain 
they consider scahng properties in different 

chain configurations 11| without quantitative compar- 
ison of the search rates between these; or they refor- 
mulate the problem with uncorrelated bulk excursions 
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15l | . Here we relate a multiparticle picture to 
single protein jumps along the DNA contour allowing us 
to consider DNA as a fluctuating chain. Comparison to 
recent single DNA measurements 16| demonstrates that 
our model quantitatively describes the effect of DNA coil- 
ing. 

General model. In our description of the target search 
process, we use the density per length n{x,t) of proteins 
on the DNA as the relevant dynamical quantity {x is the 



distance along the DNA contour). We include ID diffu- 
sion along the DNA with diffusivity -Did, protein disso- 
ciation with rate fc"^ and (re) adsorption after diffusion 
through the bulk fluid with diffusion constant D^^. The 
dynamics of n{x, t) is thus governed by 
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dx' dt' Whum{x - x',t-t')n{x',t'). (1) 
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Here, j{t) is the flux into the target site located at a; = 0, 
Wbuik(a; — x',t — t') is the joint probability that a pro- 
tein returns to the point x at time t after leaving the 
DNA at x' , t' for a bulk excursion. G{x,t) is the flux 
onto the DNA of proteins that have not previously been 
bound to the DNA. j{t) is determined by imposing the 
boundary condition n(x = 0, t) = at the target. To con- 
sider Wbuik as homogeneous in space and time we made 
two assumptions that we found to be valid for the sys- 
tem studied in [16]. First, that the DNA can be treated 
as being infinitely long when considering the search rate 
17| . Second, that the coiled DNA conformation fluc- 
tuates quickly such that subsequent excursion distances 
X — x' can be treated as independent. 

To proceed, we Laplace and Fourier transform Eq. (fl]): 

n{q, u)/W{q, u) - n„{q) = G{q, u) - j{u) (2) 

with n{q, u) ~ J^{n{q, i)}; n{q, t) = ^{n{x, t)}; no{x) = 
n{x, t — 0), and 

W{q,u) - l/{u + Diiq^ + k'^[l - I^bulk(9,u)]}. (3) 

We rewrite these equations in the form 

n{q, u) = n''%q, u) - j{u)W{q, u) (4) 
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where the density n^^{q, u) = W{q, u)[G{q, u) + no{q)] is 
the solution of the diffusion problem in absence of the 
target, i.e., when only NSB between proteins and the 
DNA occurs. In position space at a; = one obtains 
j{u) = = 0,u)/W{x = 0,u). 

In this study we focus on the long time behavior of 
j{t) for which the density n'^^{x,t) will reach an equi- 
librium value rigq. Since, by Tauberian theorems, large 
t correspond to small u we study the u ^ behavior 
of j{u), with n'^^{x,u) ^ n^q/u- For Wo{x = 0,m) we 
assume that the limit W{x = 0, u = 0) is finite and 
non-zero; this is true when the distance distribution of 
bulk excursions is sufficiently long tailed as fulfilled for 
the VFbuik encountered below. In practice, being inter- 
ested in the first arrival at the target, we assume that 
the protein concentration is sufficiently dilute such that 
this arrival happens after reaching n^^ ('rapid equilib- 
rium') but still with many searching proteins. Thus 
j(u) ~ fcidHgq/u, and therefore at long times the sta- 
tionary current jstat ^ ki^n^^ into the target is reached. 



Here we introduced k 
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W{x = 0,u = 0): 
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27r Didq^ + A;off(l - Abuikl?)) ' 



(5) 



Abuik(2:) = Wbuik(a;;W ~ 0) is the distribution of reloca- 
tion lengths along the DNA after a single 3D excursion. 
We express our results in terms of the association rate 



^on — Jstat /^bulk 



(6) 



to the target where nbuik is the density of unbound pro- 
teins in the bulk (at non-specific equilibrium). The con- 



stant of NSB per length of DNA is 



"oq/'^bull 



In terms of the total volume concentration of proteins 



"total — rihulk 



total 



DNA^eq 



, where Z^na is the total length 



of DNA divided by the volume of the entire system, we 
have Jstat ~ A;onntotai/(l + K^sl^^^j,)- The rate fcon is 
related to the mean first arrival time for the steady state 
association. Namely fcon"-buik is the probability per time 
for protein association with the target. Thus the (sur- 
vival) probability that no protein has arrived at the tar- 
get yet is Psurv(i) — exp(— fcon'T'buiki), and the average 
target search time is T = l/(fcon''^buik)- The flux j(t) 
being linear in the protein concentration "total implies 
that for sufficiently low "total the steady state is reached 
well before the first protein actually binds to the specific 
target (note that in vivo protein concentrations can be 
as low as nanomolar). 

Straight rod configuration (Fig. Qp. Consider first 
stretched DNA corresponding to a cylinder of radius nnt 
(the range of non-specific interaction), with NSB reaction 
rate ft^^ at the boundary: the flux of proteins, per length 
of DNA, onto the DNA is fc^^ times the bulk concentra- 
tion next to the DNA. This implies that at equilibrium 
of NSB, <^fc^| = nbuik^i^^ such that K^, = kZ/k^'g \M- 
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FIG. 1: Schematic of the search mechanisms in Eq. ([T]) on 
a piece of stretched DNA. Note the tendency to revisit sites 
when immediate rebindings and thus short hops are likely. 



To find the distribution Whuikix,t) for return to the 
DNA of a protein released at the DNA at a; = and 
t = 0, we solve the cylindrical diffusion equation for the 
probability density P{x, r, t) of the protein's position 



dP 
'dt 
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dx^ 
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r dr dr 



P 
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where r measures the distance to the cylinder axis. The 
reactive boundary condition for the probability flux out 
of the cylinder boundary is 



27rri„t£»3d 



dP 
dr 



kZP\r=r.,.+m5{x) 



(8) 



The initial distribution outside the cylinder is 



P{r,t)\^ 



0. From the solution of this problem 



we obtain PVbuik as /'L_^ in terms of modified 
Bessel functions [with q = \/q'^ + u/D^d] 

2TrD3dqrintKi{qrint 



<u1k(9,^) = (1 



k^Koiqr^^t) 



(9) 



With these ingredients one can evaluate numerically 
the search rate to reach the specific target on a straight 
DNA, as given by Eq. ((S]). However, some limits allow 
analytic approximation for the integral in Eq. ([5]): for 
<^ I?3d one may take 1 — Abuik(9) ~ 1, and thus 

,-1 rr^ 



"Id 



(2-^£)idfc"^) . One finds the association rate 



(10) 



where Igi — \/Did/k'^g is the so-called sliding length of a 
protein during a single NSB period. Eq. pO|) states that 
each time a protein, with rate constant /Cq,j, binds to the 
DNA within a distance Isi from the target it is able to 
"slide" onto the target. Note that in this fc"^ <C D^d 
limit the search rate /con is independent of the DNA con- 
formation. In the opposite limit ^ D^^ one may ap- 
proxim ate 1- Abuik(g ) ~ 27rD3d/[A;on/ ln(^sf /nnt)] where 
If = y/k'^fJ{2'KD3d)lsi- This produces the result [13] 



(11) 



In this limit the protein can rebind quickly after unbind- 
ing. This leads to oversampling and a lowered value of 
fcon compared to Eq. (fTU]) . can be interpreted as an 
effective sliding length including immediate rebindings 
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W^buik in the coiled conformation then becomes 



FIG. 2: Search rate enhancement on coiled DNA: a protein 
that in the case of straight DNA would perform a short hop 
can now, due to DNA looping, be captured by a foreign (re- 
mote) segment of DNA. This leads to a more efficient search 
since local oversampling is reduced by these intersegmental 
jumps [2Q|. The DNA configuration is assumed to be dy- 
namic such that when the protein is released from the foreign 
segment the latter has moved further away from the original 
segment. Dashed lines: motion of the protein; dotted lines: 
DNA configurational changes. 



. Conversely, in the limit k^^^ <C fsd the rate of NSB 
will limit rcbinding such that the protein will relocate by 
a long distance during a single 3D excursion. The part 
of Abuik corresponding to such long relocations do not 
contribute significantly to the integral in Eq. which 
is the physical reason for the approximation leading to 
Eq. (fTO|). Results (O and ^ are identical to the classical 
result for straight, infinitely long DNA However, the 
approach here allows to explicitly consider coiled DNA. 

Coil with persistence length (Fig. dp. To obtain Wbuik 
for a random coil DNA configuration we treat the DNA 
as consisting locally of straight segments. We assume 
that the local density Ibna of DNA (length per volume) 
around a segment is dilute such that the typical distance 
l/VfoNA between segments is much larger than nnt- We 
then apply a superposition technique, where the prob- 
ability that the protein has reacted with a segment is 
obtained by treating independently the interaction with 
each segment, and then choose among these the inter- 
action for which binding happens first. To obtain the 
dynamics of the interaction of the protein with one seg- 
ment of DNA we consider a large volume V around the 
segment of length L. Placing the protein at a uniformly 
random position in space relative to the DNA segment 
at < = and denoting the probability that the pro- 
tein has bound to this DNA before time t by Jsingie(i), 
then the probability that it has not bound to any of 
N pieces of DNA is (using the assumed independent 
interactions) P/°J°'s'^(<) = [1 - Jsi„gic(i)]^- Taking the 
limit of V,N,L — ^ oo for fixed Iuna = NL/V we get 
P/°-'g"(i) = exp [- Jcap(t)], where 



= 1 



h 



2tt y/uDsdnnt I<1 ( \/u/D3drint ) 



(12) 



(13) 



foreiE 



:-(0 



dt 



<u1k(9 = 0,t')di'. 



The first term on the right hand side stands for pro- 
teins that return to the original segment without being 
captured by foreign segments. Those are represented 
by the second term, where the factor ^^,o is a result of 
the assumption that a capture by foreign DNA segments 
leads to a long relocation measured along the DNA con- 
tour. These relocations only contribute close to g = in 
Fourier space. Exploiting the reasoning behind the ap- 
proximation that led to Eq. (jTU]) we discard them from 
W^buik when the integral in Eq. ^ is calculated. How- 
ever, even numerically this integral is difficult to han- 
dle. A good approximation can be found noting that by 
Tauberian theorems the Laplace inversion for Jcap(u) can 
be divided into three regimes when ^ -Dsd: 



Jcapit) = { 4ZDNAniit\/'^-D3di , 



< t < 



\-D3dt 



ln(I33dt/r"„t) 



f ^ liaL 



The first regime corresponds to reaction limited binding 
and is very brief for large fc"^ . We discard this regime and 
ignore the slowly varying log arithm in the last regime, to 
approximate pf°J°'sn [21| 



-Papp(0 = exp -4?DNAnntV^i^^3d^- ^cap^ 



(15) 



The value of /ccap is fixed self-consistcntly by the condi- 
tion that the total amount of proteins returning to the 
original segment is the same as for P/^^'^n^ ^ ^ 

dt WZl{q,t) [P/™(t) - Papp(t)] = 0. (16) 

We solved this equation numerically. But when a value 
for kcap has been obtained then the choice of Papp in Eq. 
psp is very convenient. In fact it can be written 



app 



it) 



ds 



, (17) 



where a — 4-y/7rri„t /dn A V-Dsd • When inserted into Eq. 
(fT5)l in Laplace space we need to evaluate numerically 
only one integral to obtain Wbuik((7, u), namely 



Wbuik(9,w) = 



-a' /(is) 

ds — M/^yi 



W^bulk(9:" + '^ + fccap). 

(18) 

The term with Sq^ was already discarded here. Eq. (fT5)) 
generalizes the straight rod result ^ . In the limit I?3d ^ 
fcojj it is consistent with the approximation 1 — Abuik('z) ~ 
1 leading to the expected result (fTU|) for that case. 
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In above analysis we assumed that the density of for- 
eign segments is uniform in space. Although this is not 
true for a relaxed coil (the density decreases on a scale 
of the persistence length of the DNA) we argue that this 
is not important for evaluating the search rate: in fact, 
if the protein diffuses far away from the original segment 
it will most likely perform a long relocation measured 
along the DNA contour. Again it does not matter for 
the search rate how we bookkeep these long relocations. 
To estimate the density Zdna for a random relaxed coil 
we employ the Worm Like Chain model. A good approx- 
imation for the probability density for a point a contour 
distance s away from another point to loop back is [2^ 



JM 



^7T\s\£p 



3/2 



exp 




(19) 



For a target in the middle of a chain of length L, we find 
the DNA density /dna = /-l/2 around the target. 

The theory presented here was shown in Ref. 
to quantitatively describe recent experimental data ob- 
tained from an optical tweezers setup by which the search 
rate fcon at various degrees of DNA coiling could be 
measured. The relative increase R of fcon in the maxi- 
mally relaxed DNA configuration compared to fcon in the 
stretched state, for DNA length L = 6538 base pairs, was 
measured to be around 1.1 to 1.3 in buffers with salt con- 
centrations of 5 mM MgClj and to 25 mM for NaCl. 
This relative increase R compared well with the theory 
presented here using estimates of Zdna based on Eq. ([19]) 
giving 1/VIdna ~ 500 bp, and a value ~ 100 bp 
based on a quantitative analysis [l^ of experiments jS] 
suggesting effective sliding lengths of this size. Numer- 
ical analysis of the theory presented here showed that 
R depends significantly only on /dna and l^^ for realis- 
tic ranges of the remaining parameters. At physiological 
salt (100 mM NaCl) an increase of R ~ 1.7 was found 
experimentally [1^. This observation agrees well with 
the increased DNA density expected due to attraction 
between segments induced by the 5 mM divalent Mg^'*' 
ions at this NaCl concentration, see Ref. [l^ for details. 
We are not aware of a theory to calculate ?dna under such 
attraction. However, the value 1/a/?dna ^ needed to 
obtain R « 1.7 at this salt conditions appears reasonable. 

We presented a model for facilitated diffusion that is 
useful to calculate the search rate of DNA binding pro- 
teins for their specific binding site on a fluctuating DNA 
coil with variable DNA density. This model represents 
a convenient way to rephrase the Berg-von Hippel the- 
ory with the additional advantage that it interpolates 
between stretched and coiled DNA configurations. In 
the development of this theory a number of assumptions 
were made, in particular that of sufficiently fast local re- 
laxation of the DNA configuration. This assumption was 
found to be satisfied for the protein EcoRV used in [l^ . 
but it will not hold universally. It would therefore be 



beneficial to generalize the theory to slowly fiuctuating 
or static DNA configurations. 
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